Attorney Docket No. 23087-703 



SYSTEM AND METHOD FOR EDITING DIGITAL IMAGES 
USING INDUCTIVE IMAGE GENERATION WITH CACHED STATE-SPECIFIC 
IMAGE TILES 



Inventor(s): 

Richard T. Minner 

2635 Napoli Court 

Carmichael, CA 95608 

A citizen of the United States of America 

Mark Boenke 

653 Tupper St. 

Santa Rosa, CA 95404 

A citizen of the United States of America 

Richard Walker 
631 O'Farrell St. 
Apt. 1701 

San Francisco, CA 94109 

A citizen of the United States of America 



Assignee Xippix, Inc. 

125 E. Sir Francis Drake Boulevard 
Larkspur, California 94939 



Entity: 



McCutchen, Doyle, Brown & Enersen 
Three Embarcadero, Suite 2500 
San Francisco, CA 94111-4067 
(415) 393-2000 



1 



BACKGROUND OF THE INVENTION 

This invention relates in general to editing digital images over a network, 
and, in particular, over the Internet. 

Computer systems can hold digital images that are far larger than they can 
display. A typical computer display has a resolution of approximately 1,000 rows and 
1,000 columns, for a total of about 1,000,000 pixels. The computer's hard drive may store 
images destined eventually for printing at resolutions of thousands of pixels per inch, and 
containing hundreds of milhons or bilhons of pixels. 

Given the inherent mismatch between the size of an image a computer can 
store and the size of the image it can display, the process of interactively editing high- 
resolution digital images on a computer is necessarily one in which the image the user 
sees is a proxy for the image she is actually editing. 

A proxy image is created by downsampling or windowing. 

Downsampling comprises supplementing an original image by a new 
lower-resolution image in which each pixel is a weighted average of (or a sample from) a 
square of pixels from the original image. A downsampled image is a lower-resolution 
representation of the same scene shown in the original image. 

Windowing comprises selecting a rectangular subset of pixels from a 
larger rectangular image, with the rectangular window parallel to the orientation of the 
larger rectangle. The windowed image is at the same resolution as the original image, but 
shows only a fraction of the scene shown in the original image. 

The key to making a fast digital-image editing system is to take advantage 
of the low resolution of the computer display and of the proxy images displayed on it. 
The number of pixel fransformations produced by the system should be on the order of 
magnitude of the number of pixels delivered to the computer's display screen, with 
processing of the high-resolution image deferred until the user chooses to save it. 
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A digital-image editing system is needed which has the rapid image- 
transformation speed made possible by operating on low-resolution proxy images, but 
which still makes it possible for the user at any time to zoom into an window of the 
archival image or of an intermediate-resolution image in order to view the edited version 
of the higher-resolution image. 

The system preferably permits the user to back out of decisions, and revert 
to an earher state of the editing session. I.e., the system should permit the user to undo 
any sequence of editing operations. 

The system preferably permits the user to enter edit instructions and to 
view proxy images on a client computer remote from the server computer on which 
archival images reside and on which the pixel-transforming edit operations are carried 
out. I.e., it should be a networked system, suitable for use on local or wide-area networks, 
and, in particular, on the Internet, using the protocols of the World Wide Web. 

The system desirably does all of the above cost-effectively. In particular, 
the preferred mode is for the system to remain fast and flexible on computers with limited 
processing speed or with limited amounts of random access memory (RAM) relative to 
the image editing task. 



SUMMARY OF THE INVENTION 

The preferred embodiments of the invention are a system and method 
using inductive image generation with cached, state-specific image tiles for editing digital 
images. A computer system contains an archival digital image to be edited and viewed, a 
transformation state list, describing editing operations to be applied to the archival image 
in order to produce the current edited-image rendition, and a viewing data set, describing 
the resolution, offset and extent of the current view of the current edited-image rendition. 
The current view is constructed from a set of image tiles kept in a tile cache. In response 
to an instruction to generate the current view, the system identifies the requisite tiles, and 
then generates each tile by an inductive image generation process. The latter process 
comprises (a) ascertaining if the tile is already in the cache, and, if not (b) ascertaining if 
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the image is unedited and if so obtaining the requisite tile from the image file, and if not 
(c) generating the tile by identifying and assembling the requisite supplier tiles in the 
previous state of the image-editing process and by then applying the current state's 
editing transformation function to the supplier tiles. The inductive image-generation 
5 process is recursive in that in case it is necessary to assemble supplier tiles in the course 
of the inductive image-generation process then they too are assembled by the process of 
inductive image generation. 

The use of inductive image generation with cached state-specific image 
tiles lets the system build on earlier work without forcing the system to hoard earlier 
work: tiles are cached for later use, but the system does not require an exhaustive cache of 
earlier-constructed tiles; tiles may be purged firom the cache as required; once a tile is 
purged firom the cache it is really gone from memory; there is no residual stub or place- 
holder. The system also constructs image tiles only when they are needed, either directly, 
as part of a view of an image, or indirectly, in order to construct, by the process of 
inductive image generation, a tile that is needed directly; there are no stubs or place- 
holders in system memory for tiles that have not yet been referenced but might be needed 
in the future. Hence the system is able to produce results rapidly without requiring large 
amounts of digital memory in order to do so. These capabihties are particularly 
advantageous in hght, first, of the frequent need to undo editing decisions, and in light, 
second, of the frequency with which zoom operations are appHed in editing a master 
image by means of a proxy image-editing system. 

The following is an example of the advantages of the system in light of the 
25 need to undo editing operations. Imagine an image-editing session that has gone through 
24 transformation states and in which the user wishes to revert to the 16* state. Because 
no single state-specific tile is precious, the system was able to go from state 16 to state 24 
without worrying obsessively about conserving earlier work; it may well have purged 
some state-16 tiles from the tile cache in the process. But the chances are also good that 
30 there is a patchwork of various tiles from states 0 through 16 still remaining in the cache. 
The inductive image generation system is able to build on whatever combination of tiles 
remains in order to rapidly regenerate the state-16 view. 



4 

The following is an example of the advantages of the system in light of the 
frequency with which zoom operations are applied in editing a master image by means of 
a proxy image editing system. Imagine again an image-editing session that has gone 
through 24 transformation states. The user is likely to have zoomed and panned around 

5 the image from time to time. Suppose the user normally viewed a 1/32-resolution 

overview of the image, but that she zoomed into a particular point of interest in the 1 : 1 
fiill-resolution version of the image when in state 16 of the editing process. Suppose she 
is now in state 24, viewing the 1/32-resolution view, and that she then zooms into the a 
1 : 1 full-resolution view centered on the same point of interest that she viewed in detail in 

0 state 16. When, at state 16, the user looked at this area of interest in a fiill-resolution 1:1 
view, the system put the requisite fiill-resolution state- 16 tiles in the tile cache. Some of 
these tiles may have subsequently been purged. But some may remain. And the system, 
using inductive image generation is able to build on the remaining 1:1 state- 16 tiles to 
build the requisite 1 : 1 state-24 tiles without having to regress all the way back to the 

5 state-0 1:1 tiles that may be fetched from the image file. 

Other features and advantages of the present invention will be understood 
upon reading and understanding the detailed description of the preferred exemplary 
embodiments, foxmd hereinbelow in conjunction with reference to the drawings, in which 
0 like numerals represent like elements. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Fig. 1 shows the components of the system in case it is embodied in a 

single computer. 

Fig. 2 shows the components of the system in case the main image editing 
functionality is running on a server computer and the user interface fimctionality is 
0 running on a client computer, with the two computers connected by a network such as the 
Internet. 

Fig. 3 shows a pixel-tile conversion schema permitting pixel addresses to 
be transformed into tile addresses and vice versa. 



Fig. 4 shows some of the principal submodules of the preferred 
embodiment's application module. 

Fig. 5 is a flow chart showing the flow of control of operations within the 
preferred embodiment's GetTile function. 

Fig. 6 is pseudo computer code illustrating six of the preferred 
embodiment's principal functions. 

Fig. 7 shows the calling structure of the functions shown in Fig. 6, and also 
shows the locations of the functions within the modules of Figure 4. 

Fig. 8 illustrates inductive image generation under the example of an 

image rotation. 



DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

The first subsection outlines the operations of the system when running on 
a single computer. There is then a subsection outlining the system when running in a 
networked chent-server environment. This is followed by two subsections describing in 
detail the process of inductive image generation with cached, state-specific image tiles. 
There is then a concluding section. 

1. SINGLE-COMPUTER SYSTEM 

Figure 1 illustrates the components of the system of the preferred 
embodiment in case it is running on a single computer 10. 

In a specific embodiment, the system runs on computers with CPUs 
manufactured by Intel and with operating systems manufactured by Microsoft, but the 
invention is suited for any general-purpose computer. 



A user of the computer uses user-interface devices such as keyboard 12 
and mouse 14 to issue instructions to the system, and sees the results of these instructions 
on video display 16. 



The computer contains a hard disk drive or other non-volatile storage 
device for digital data 1 8 containing at least one image file 20. 

The software of the system is partitioned into a user-interface (UI) module 
22 and an application module 24. In a specific embodiment, the software is written in the 
C++ and Java programming languages, but it could be written in any general-purpose 
computer-programming language. 

The UI module 22 takes in inputs from UI devices such as keyboard 12 
and mouse 14 and translates these inputs into file-selection, editing and viewing 
instructions. The latter instructions are then encoded and communicated to application 
module 24. 

Application module 24 carries out file-selection, viewing and editing 
operations. It does so with the assistance of viewing database 25, transformation database 
26, and transformation functions 28. The contents of viewing database 25 and 
transformation database 26 change in response to image-editing and viewing instructions 
received from UI module 22. The set 28 of viewing transformations is fixed and invariant 
during any single image-editing session. 

At any one time, a single image file, such as image file 20, is displayed. 

The image in the file is displayed as it appears after a sequence of edit 
operations, and after a viewing transformation, detailed instructions for which are 
encoded in viewing data 25 and in transformation state hst 26, respectively. 

Viewing data 25 include the resolution, the offset and the extent of the 
current view. Resolution is defined here to be a linear extent, height or width, of the 
current view as a ratio to the corresponding linear extent of the archival file. So, for 
example, a view of resolution 1/2 has half the height and half the width of the archival 
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file. Offset is measured horizontally and vertically, in pixels, from the top left of a fiiU- 
content image of the given resolution. The pixel indicated by the given offset will be at 
the top left of the displayed image. Extent is measured horizontally and vertically, in 
pixels, from the top left of the displayed image. 

The resolution of the current view may have resulted either directly or 
indirectly from the user's viewing decision. Systems will often be constructed so that the 
initial view is the highest-resolution view which fits within a given computer-screen 
display window and gives an overview of the fiill scene in the archival image. There may 
be additional constraints, such as that downsampled images are available only in even 
powers of two. Thus a very high-resolution archival hnage may need to be downsampled 
to a resolution of, say, 1/64 in order to fit within the viewing window. A lower-resolution 
archival image may be downsampled to, say, a resolution of only 1/4. Subsequently, the 
user may issue exphcit instructions to zoom in by a factor of two on the displayed image, 
thus explicitly requesting a 1/32-resolution image in the first case and a 1/2-resolution 
image in the second case. 

Transformation data are encoded in transformation state list 26. The state 
list is a sequentially-numbered list of transformations to be applied to the archival file, 
with the numerical sequence anchored at number 0. Each state in the sequence is 
characterized by a pointer to a set of transformation ftinctions and by a set of 
transformation data further characterizing the transformation fimctions. For example, the 
particular example illustrated at 28 in Figure 1 is one in which seven families of 
transformations can be appHed to the image. These are: 

The anchor transformation 30, Fetch image data from the file; 
32, Rotate the image; 

34, apply a Selective Color operation to the image; 
36, apply an HLS (Hue, Lightness, Saturation) color operation to the 
image; 

38, apply an Unsharp Mask operation to the image; 
40, apply an Airbrush operation to the image; 
42, Erase a Defect in the image. 



The edited image is constructed by applying a sequence of such 
transformations to a file image. 

The first transformation applied, at anchor state 0, is always to fetch the 
image fi-om the file, operation 30. 

Subsequent operations can be any ones of the remaining transformations, 
in any order. Operations can be applied more than once. The exact manner of their 
application is indicated by the transformation data stored in association with each node in 
the state hst. For example, following the anchor image-fetching operation at state 0 of 
Figure 1, the subsequent three operations are: 

1, Rotate; 

2, HLS; 

3, Rotate (again). 

In this example, the transformation data associated with the three states 

might be: 

1, Rotate 30 degrees clockwise; 

2, Lightness increase by 20 percent; 

3, Rotate 11 degrees counterclockwise. 

The transformation data associated with anchor state 0 identifies the image 
file and describes characteristics of it and of the methods to be used for fetching data fi-om 
it; some image files contain only the full-resolution image; other image files supplement 
the fiill-resolution image, in the same file or in linked files, with precomputed 
downsampled versions of it. In the former case, any downsampled image requested at 
state 0 has to be constructed fi-om the archival fiill-resolution file image. In the latter case, 
downsampled images of some resolutions are obtainable directly from the file; 
downsampled images of other resolutions can be constructed from file images closer in 
resolution to the target image than is the fuUresolution image. 
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Viewing dataset 25 and transformation state list 26 together contain all the 
information needed by application module 24 to construct the currently-requested image 
view. 

Status bit 27 indicates whether or not a view with the characteristics 
encoded by viewing and transformation datasets 25 and 26 has been constructed. When 
application module 24 receives new transformation or viewing instructions from UI 
module 22, it updates the viewing and transformation databases 25 and 26 to conform to 
the just-received instructions and then resets status bit 27 to value 0 in order to indicate 
that the view currently displayed on screen 16 is now inconsistent with the instructions m 
databases 25 and 26. AppHcation module 24 then constructs the requisite view, as 
specified by the two databases, and then, finally sets the status bit, thus indicating that the 
view now constructed is the one specified in the databases. 

A view is constructed by building or assembling a set of image tiles, 
placing them in a tile cache, and locking them down there until they have been copied 
into the system's video RAM 46. hnage tiles are specific to each state; a transformation is 
carried out by building a new set of image tiles for the transformed state, each tile 
carrying the identity of its state. These new tiles are then added to the tile cache 44 in 
which they then supplement earher image tiles for earlier states. However, the cache is of 
limited size. Hence it is occasionally necessary to purge old tiles, not locked down for the 
current view, from it. Thus some tiles that have been created in the history of an editing 
session may be missing from the cache. 

The preferred embodiment requires an image-tihng schema sufficiently 
well defined so that given the identifier of a tile the identifiers of aU pixels contained in it 
can be computed, and so that given the identifier of a pixel the identifier of the containing 
tile can be computed. A particularly simple example of such a scheme, used in the 
specific embodiment described here, is one in which all image tiles are rectangles or 
squares of the same dimensions, such as 128 columns and 128 rows, and in which image 
tiling obeys the following fiiU set of rules: 

(1) all tiles are of dimension n by n; 

(2) the origin of each image is at the upper left; 
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(3) the coordinates of the pixel at the origin are (x=0, y=0); 

(3) pixels are identified by their column, x, measured as a 0-anchored 

count to the right of the origin pixel, and by their row, y, 
measured as a 0-anchored count down firom the origin 
pixel; 

(4) the image is partitioned into a square grid of tiles in such a manner that 

the pixel at the top left of the tile at the top left is the origin 
pixel; 

(5) tiles are identified by their column, x, measured as a 0-anchored count 

to the right of the origin tile, and by their row, y, measured 
as a 0-anchored count down from the origin tile; 

(6) n = 128. 

This particular image-tiling scheme of (l)-(5) above is illustrated, for the case n = 4, in 
Figure 3. 

Given these particular rules, it follows that if the width of the image is not 
an even multiple of the standard width of a tile, then tiles at the far right of the image will 
have less than standard width. Similarly, tiles at the bottom of the image may have less 
than standard height. These effects can be seen in Figure 3. Rule (1) above, specifying 
identical dunensions for all tiles, can be amended to reflect these exceptions, or the tiles 
at the right and bottom of the image may be always padded out to the standard dimension 
with transparent pixels. It makes little difference which approach is chosen. 

Given rules (1) through (6) above and the specification of an image, a tile 
is uniquely identified by its row and column pair, (x, y). But the tile cache in the prefeixed 
embodiment intermingles tiles fi-om different images, differing fi-om the archival file 
image in their resolution and edit state. Hence it requires four numbers to specify the 
image-portion contained within an image tile: 

(1) the state of the image tile; 

(2) the scale of the image; 

(3) the column (x value) of the image tile; 

(4) the row (y value) of the image tile. 
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For example, suppose we are given the state list 28 of Figure 1, with 
specific transformations as described above: 

1, Rotate 30 degrees clockwise; 

2, Lightness increase by 20 percent; 

3, Rotate 11 degrees counterclockwise. 

Then the tile (state=0, scale=l/2, x=0, y=0) is the top left tile from a 1/2 
resolution view of the image as it originally was, with no transformations appUed to it. 
And the tile (state=3, scale=l/4, x=0, y=0) is the top left tile in a 1/4 resolution view of 
the image as it appears after the three transformations (rotate +30 degrees, hghten 20 
percent, rotate -11 degrees) listed above. 

Image generation is done inductively in the preferred embodiment in order 
to take advantage of the fact that the tile cache 44 may already contain a set of supplier 
tiles from states that are close predecessors of the current state s. The application module 
24 computes from its viewing data 25 the identifying characteristics (state, scale, x, y) of 
the set of tiles it needs to assemble in order to generate the current view. For each 
required tile, the application looks in tile cache 44 to see if the tile is present there. If so, it 
locks the tile in place, to prevent it from being purged until the view is complete. If not, it 
computes the set of supplier tiles required from the antecedent state s-1 (with the same 
given scale) in order to build the currently-needed tile. It looks for these suppUer tiles in 
the tile cache 44, and if they are present, apphes the state-s transformation to them in 
order to generate the required tile. If the supplier tiles are not present, the preferred 
embodiment recurs backward, looking for the supphers of the suppliers, in order to build 
them, and so forth. This recursive process is guaranteed to terminate, since the eventual 
supplier set of any tile can always be found, in the worst case, at state 0 in the file itself 

Eventually, using the process of inductive image generation, the 
application module 24 succeeds in assembling, in tile cache 44, all the tiles needed for the 
current view. These tiles are then all present in the cache, and locked against any tile- 
purge attempt. The application module then copies these tiles, with the appropriate offsets 
so they fill up the image grid, into video RAM 46, whereupon they become immediately 
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visible on computer display 16. Finally, the application module then unlocks the just- 
copied cached tiles, so that the digital memory they consume in the tile cache is 
recoverable if needed for new tiles later. 

This completes the overview of the system when running on a single 
computer. The subsequent subsection outlines the system when running on a server 
computer and a client computer, connected by a network. 
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2. CLIENT-SERVER SYSTEM 

Figure 2 shows how the system can be partitioned between a server 
computer 50 and a cHent computer 70, communicating over a network such as the 
Internet 60. The server then retains the basic image-editing computational tasks of the 
system, and the chent handles the user-interface and display tasks. 

In Figure 2, server computer 50 has retained from the single computer 
system 10 the hard disk drive 18, including archival image file 20, the RAM tile cache 44, 
the transformation state hst 26, the set of transformations 28, the viewing data 25, the 
status bit 27, and the application module 24. Server computer 50 has acquired a 
messaging module 52, capable of transferring and receiving messages over Internet 60 
encoded into the Internet transmission protocol language TCP/IP and perhaps also 
encoded into the World Wide Web's messaging protocol HTTP. 

The server's messaging module 52 receives from the client computer 70 
editing, undo and viewing instructions. An editing instruction is an instruction to advance 
the editing state 26 to a new state characterized by a specific editing operation 28 and 
associated editing-parameter data transmitted by the client. An undo instruction is an 
instruction to revert the state to an earlier state in state list 26. A viewing instruction is an 
instruction to change the scale, offset, or extent specified in viewing data set 25. The 
resulting requisite image tiles are then the tiles that will render the image edited according 
to the instructions in state list 26 and viewed according to the viewing parameters 
encoded in viewing data store 25. 

The client computer 70 in the client-server system acquires the keyboard 
12, mouse 22, and user-interface software module 22 of the single-computer system 10. It 
also has the single-computer system's video display 16 and video RAM 46. Client 
computer 70 also has its own messaging module 72, its own tile cache 76, and its own 
application module 74. The chent's application module 74 handles some of the functions, 
particularly the copying of tiles from chent tile cache 45 into video RAM 16, that were 
handled by the integrated system lO's application module 24. 
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The path a tile travels on its way to being displayed is now as follows. The 
server computer's application module 24 uses the process of inductive image generation, 
using the RAM tile cache 44, viewing data 25 and the transformation state list 26 and 
transformations 28 to assemble in tile cache 44 the requisite set of tiles for the current 
image view. The tiles are then passed by server apphcation module 24 to server 
messaging module 52 where they are encoded into the appropriate packets and sent across 
Internet 60 to the client computer's messaging module 72, whereupon the client 
computer's edit module 74 decodes them and places them in the client computer's own 
tile cache 76. The client computer's own edit module then copies the newly-received tiles 
from the tile cache 76 into video RAM 46, whereupon they become immediately visible 
on video display 16. 

The tile cache of chent computer 70 must be big enough to hold at least 
one tile. It is advantageous to make it much bigger than this. Client systems can be 
designed with minimal, moderate or extreme sophistication about tile caching. In the 
former case, the client system copies each tile from its cache 76 into video RAM 46 on 
receipt, and purges each tile from the cache after it has been copied to video RAM. In the 
latter case, the client system 70 retains the state and scale information of all tiles in its 
own tile cache, retains awareness of the current state of the system, and retains a copy of 
viewing database 25, so it has the same awareness of the properties of the current view as 
does server system 50. In this case the client system has a caching system fully 
comparable to that of the server; the chent, in response to a user-initiated view change or 
undo operation, can look in its own cache for the required tiles before requesting them 
from the server. 

3. INDUCTIVE IMAGE GENERATION: DESCRIPTION 

The following is an outline of the three functional components that are key 
to the inductive image generation process. This outline is followed by a more-detailed 
description of the inductive image-generation process. 

In the preferred embodiment, there is a function, called 
FindGeneratingRegion in this document, which, given the generating transformation 
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associated with a state, is capable of answering the following question for any tile in any 
view of the image: 

Ql : What is the generating region for the tile? I.e., what region of the 
prior-state image at this resolution is such that apphcation of this state's 
transformation operation to it will generate the requisite tile? 

There is also a function, called FindTilesIntersecting here, which is capable of answering 
the following question for any region in any view: 

Q2: What are the tiles that intersect the region? 

Finally, there is a function, called BuildTile here, which, given the 
existence of all state s-1 supplier tiles required to build a given state s tile under the 
transformation associated with state s, is capable of applying the transformation to the 
state s-1 tiles in order to build the state-s tile. 

It follows that by starting with a tile of a given state, answering Ql for it, 
thus obtaining its generating region, then passing the identity of this region back to the 
prior state and answering Q2 for it, the system can identify the set of tiles of the prior 
state minimally necessary to generate the currently-needed tile. The system, having 
identified these prerequisite tiles, may or may not find them in its tile cache, but, because 
of the constructibility of all state-0 tiles firom the image file, the system is guaranteed that 
by iterating this process it will, eventually, find a set of tiles, cached or in the image file, 
to which it can apply function BuildTile to eventually build the required tile in the current 
state. 

The following is a more detailed description of the inductive image 
generation process. 

Figures 4 through 8 illustrate the inductive image generation process, as 
carried out by the application module 24 of the preferred embodiment. 
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Figure 4 illustrates six modules within the application module 24. Shown 
in Figure 4 is the upper-level module 94 and five modules used in the tile-assembly 
process to assemble the set of image tiles conforming to the current viewing instructions 
25 and the current state list 26. 

The tile-assembly process begins when a procedure in the upper-level 
module 94 calls a procedure in the tile-assembly module 95. The latter procedure then 
calls into one of three specific tile-acquisition modules: tile-building module 96 
constructs a tile m state s by applying the state-s image transformation to tiles in state s-1; 
cache-handUng module 97 extracts a tile from tile cache 44; file-handling module 98 
extracts a state-0 tile from the image file. 

With one important exception, control flows in a single direction, from the 
tile-assembly module 95 to the specific tile-acquisition modules 96, 97 and 98. The 
exception is for the flow of control between tile-assembly module 95 and tile-building 
module 96. Suppose that the only way to get a given tile in a given state is to build it from 
tiles in the prior state. In that case, control flows from the tile-assembly module 95 to the 
tile-building module 96 as shown by arrow 87. But the requisite tiles from the prior state 
also need to be assembled. Hence control will flow back into the tile-assembly module 
95, as shown by arrow 88, in order to assemble the prior-state tiles. 

All four tile-acquisition modules 95, 96, 97 and 98 have access to a pixel- 
to-tile (and vice-versa) franslation module 99. The incorporation within the preferred 
embodiment of a well-defined pixel-tile conversion schema, as in Figure 3, means that 
tiles can be requested either directly, by their (state, scale, tile-x, tile-y) addresses, or 
indirectly, by the (state, scale, pixel-x, pixel-y) addresses of pixels they enclose. The 
preferred embodiment makes use of the dual-modality of the addressability of tiles. For 
example, the specific embodiment described here is capable of fetching tiles explicitly, by 
specification of their own addresses or implicitly, by specification of a region of image 
pixels. Function GetTile, shown at 109 in Figures 5, 6 and 7 fetches a tile specified by its 
explicit address. Function GetTiles, shown at 101 in Figures 6 and 7, fetches all tiles that 
intersect a specified region, with the latter specified by the pixel addresses of its 
boundary. 
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Figures 5, 6 and 7 show details of the operations within the tile-acquisition 
modules of Figure 4. 

Figure 5 shows the flow-of-control for the explicit-address tile-acquisition 
function, GetTile. This and other functions of the preferred embodiment have been given 
exphcit names in order to permit their relationships to be shown in Figure 6 by pseudo 
computer code. The flow-of-control of one function, GetTile, is illustrated by both a flow 
chart, Figure 5, and by pseudo code. Figure 6b. Details of the operations of other 
functions are shown by the more-compact pseudo code notation only. The pseudo code 
illustrates the flow of control. The preferred embodiment may be embodied in computers 
that achieve the same flow of control by running computer code in which functions have 
other names and other organizations than in Figure 6. In particular, the functions and 
structures of the computer code can be organized into objects as is done in object-oriented 
computer languages such as C++, or programming tasks can be organized into threads as 
is possible on multi-threaded computer operating systems. In these cases, the fimctional 
organization differs from the functional organization shown in the pseudo code of Figure 
6, but the flow of control imphed by the functional organization remains the same as the 
flow of control imphed by Figure 6. 

Figure 6 includes pseudo code for six functions. The identifying numbers 
used henceforth for these functions are the line numbers of their first hnes in Figure 6. 
These six flmctions reference other functions for which no pseudo code is shown here. 
These functions are assigned identifying numbers that do not correspond to line numbers 
in Figure 6. 

Figure 7 shows the dependency relationships among the functions 
illusti-ated by pseudo code in Figure 6, and shows the location of these functions within 
the tile-acquisition modules of Figure 4. (Figure 7 does not show the order in which 
functions are called, but only their dependency hierarchy; i.e., what calls what.) 

In the remaining description of a specific embodiment a tile address or tile 
ID is the identifying quadruplet (state, scale, x, y). A "tile" is a stiiictiire including the 
appropriate pixel bitmap or "tile raster," in addition to a header containing the tile ID 
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quadruplet (state, scale, x, y). A consequence of these conventions is that exchanging a 
tile address for the corresponding tile does not entail loss of the address. 

The following description of the GetTile process can be followed with 
reference to either the flowchart Figure 5 or the pseudo code Figure 6b. 

Function GetTile 109 is passed a tile address. It first ascertains, at 1 1 1, 
whether the tile with the given address is in tile cache 44. If so, it gets the tile from the 
cache at 1 12 via a call to function GetTileFromCache 164. If the tile is not in the cache, 
fiinction GetTile ascertains, at 114, whether the edit process is in the anchor state 0. If so, 
it gets the tile from the image file at 1 19 via a call to fimction GetTileFromFile 193. If the 
edit process is not in anchor state 0, the requisite tile has to be built from prior-state 
suppher tiles, and GetTile builds the tile from the prior state at 1 16 via a call to fimction 
BuildTileFromPriorState 124. 

Having obtained the tile, fimction GetTile calls, at 120, fimction "Process" 
133 which adds the tile to the cache (if it is not already there) by a call 136 to CacheTile 
151, and then locks the tile in the cache by a call 137 to LockTile 172 so that it cannot be 
purged by subsequent calls to CacheTile. 

Function GetTile 109 gets a tile specified by its explicit tile address. 
Function GetTiles 101, in confrast, gets all the tiles that intersect a region specified by 
pixel addresses. The incorporation within the preferred embodiment of a well-defined 
pixel-tile conversion schema, as in Figure 3, means that the preferred embodiment is able 
to carry out procedures such as FindAUTileslntersecting 191 which is passed a region 
identified by pixel addresses (and also a state and a scale), and which then returns the list 
of all (state, scale, x, y) tile addresses of all tiles that intersect the region. Procedure 
GetTiles 101 calls FindAUTileslntersecting at 102 and then calls GetTile at 105 for every 
tile address in the hst. Every tile obtained by GetTile 109 is then locked into the cache via 
a call at 120 to Process 133, so all tiles obtained by GetTiles end up locked into the tile 
cache. 

As is illustrated in Figure 7, functions GetTiles and GetTile are the two 
entiy points into the tile-acquisition modules. These two entiy points are called by 
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functions in the upper-level module 94 of the appKcation module 24. Hence, as can be 
seen with reference Figures 1 and 2, an image request coming into the appUcation module 
24 from UI module 22 can specify the required image portion either as an explicit tile (in 
which case GetTile is called) or as a (perhaps rectangular) image region specified by its 
bounding pixel vertices (in which case GetTiles is called). Hence, in the case of the 
client-server application of Figure 2, the client's application module 74 may either have 
the ability or lack the ability to compute the explicit tile addresses of tiles corresponding 
to an image region. If it lacks this ability, it must request image tiles indirectly, by asking 
for a region of pixels. If it has this ability, it can request tiles directly. 

The core of the preferred embodiment is invoked when procedure GetTile 
finds, at step 116, that it needs to build the required tile from tiles in the prior state via a 
call to BuildTilesFromPriorState 124. The ensuing process is illustrated in Figure 6c and 
in Figure 8. 

Figure 8a illustrates a 25-tile state-s-1 image 200 that then undergoes a 30- 
degree clockwise rotation that transfonns it into the 49-tile state-s image 210 of Figure 
8b. Because all images in the specific embodiment must be horizontal rectangles, the 
rotated state-s image is bigger than the state-s-1 image. The rotated image is padded out 
with empty tiles such as (x=0, y=0) in order to keep the rectangle of the image horizontal. 

Suppose there is a call to GetTile(s, scale, x=4, y=2), requesting tile 220 of 
Figure 8b. (The value of "scale" is irrelevant here, so long as it is the same in both image 
200 and 210.) Suppose further that function GetTile could not find tile (s, scale, x=4, y=2) 
in the tile cache, so that it called at 116 procedure BuildTileFromPriorState 124, passing 
it the tile address (s, scale, x=4, y=2). 

The first thing BuildTileFromPriorState does at 126 is to pass the tile 
address (s, scale, x=4, y=2) to function 188, FindGeneratingRegion. The job of the latter 
function is to compute the pixel-coordinate vertices of the region in the prior state from 
which the current tile is generated. In Figure 8, tile 220 comprises a portion of the 
forehead of the rotated face. By reversing the rotation, as in Figure 8c, procedure 
FindGeneratingRegion derives the diamond-shaped area 230 in state s-1 that is the 
source, under tiie rotation transformation leading to state s, of the area 220. At step 126 
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the pixel-coordinate vertices of area 230 are returned to BuildTileFromPriorState as 
"region." 



At step 127 of BuildTileFromPriorState, the just-computed region is 
5 passed, along with the identifier of the prior state, state s-1, and the scale, to the already- 
described tile-assembly procedure GetTiles 101. Procedure GetTiles then concludes, by a 
call at 103 to FindTilesIntersecting 191 that the four tiles indicated by hatching in Figure 
8c, comprising collectively region 240, are the state-s-1 tiles that intersect region 230. 
These are the requisite state-s-1 suppher tiles for generation of the state-s tile 220. 
1 0 Procedure GetTiles then, at 105, calls procedure GetTile again for each of these state-s-1 
tiles. 



Procedure GetTile is guaranteed to succeed in constructing each of the 
four requisite state-s-1 tiles in region 240 of Figure 8c. How it does so depends on the 

1 5 detailed characteristics of the state list 26 and of the tile cache 44 at the time GetTile is 
called. GetTile may find a state-s-1 tile in the cache. It may have to call 
BuildTileFromPriorState again to build a state-s-1 tile from state-s-2 tiles. It may have to 
go all the way back to state 0 and get suppher tiles from the file. It may do some 
combination of the above. What is known, fi-om the way the fimctions are constructed, is 

2 0 that GetTile will get each of the four tiles in region 240, and that it will place each one, 
locked, in the tile cache. The result is that when, at 106, procedure GetTiles returns back 
to calling procedure BuildTilefromPriorState at 128 all four requisite state s-1 supplier 
tiles 240 will be locked down in the tile cache. 



25 At line 128, finally, procedure BuildTileFromPriorState calls procedure 

BuildTile in order to build the (state=s, scale, x=4, y=2) tile 220 from supplier tiles 240. 
Because of the prior calls at 126 and 127 to FindGeneratingRegion and GetTiles, the 
supplier tiles 240 required for the call 128 to BuildTile 160 are now guaranteed to be in 
the tile cache. 



Function BuildTileFromPriorState obtains at 128, on return firom BuildTile 
160, the sought-after state-s tile 220. 
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Then, at line 129, procedure BuildTileFromPriorState calls procedure 
UnlockTiles 142 in order to unlock (make available for possible future purging from the 
tile cache) the four state-s-1 tiles 240 that were used to generate the just-constructed state 
s tile 220. 

Then, at 130, procedure BuildTileFromPriorState returns back to calling 
procedure GetTile at line 11 6, returning the just-generated tile. 

Then, at 120, GetTile passes the tile to procedure "Process" 134. The latter 
function looks, at 135, to see if the tile is already in the tile cache. In the case of this 
example tile 220 is not aheady in the tile cache, so procedure Process passes tile 220 at 
136 to procedure CacheTile 151. Procedure CacheTile checks at 153 to see if there is 
room in the cache to store tile 220. If not, CacheTile calls at 154 to PurgeTilesFromCache 
168 which removes some unlocked tiles from the cache using criteria which may vary 
depending on the embodiment. Finally, at 155, there is known to be sufficient room in the 
cache for tile 220, and tile 220 is cached via a call to AddTileToCache 170. 

Then, at 156, procedure CacheTile returns back to procedure Process at 
137, whereupon the newly-constructed tile 220 is locked in the cache. It will remain 
locked until it has been dehvered to the video RAM 46, in the case of the single computer 
application of Figure 1, or to the client computer's tile cache 45, in the case of the client- 
server system of Figure 2. 

4. INDUCTIVE IMAGE GENERATION: SYSTEM EXTENSIBILITY 

In Figures 1 and 2, the list 28 of available image transformations contains 
seven transformations. It is a characteristic of the preferred embodiment that it is very 
easy for an engineer maintaining an embodiment of this invention to extend the system by 
adding an eighth transformation to the hst. 

Six functions were illustrated by pseudo code in Figure 6, and these 
functions called 11 more functions. All 17 functions are shown m Figure 7. Of these 17 
functions only two 
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BuildTile 160, and 
FindGeneratingRegion 188 



depend on the set 28 of image transformations. Only these two functions need to be 
rewritten when a transformation in set 28 is edited or added. Other functions of the 
system depend on the image state, but only as a means of identifying image tiles; hence 
these other functions do not change when an existing image transformation is modified or 
when a new image transformation is added. This special status of the two functions 
BuildTile 160 and FindGeneratingRegion 188 has been indicated in Figure 7 by drawing 
their enclosing boxes in bold and by enclosing them in a transformation-dependent 
submodule 93 of tile-building module 96. 

In many cases, in fact, the addition of a new transformation function to the 
set 28 will require modifying only one function, BuildTile 160. 

Many image transformations share the same procedure for finding the 
prior-state generating region of a given tile. In particular, there are many "stationary" 
transformations with the property that pixel (x, y) in state s is constructed from the pixel 
at the same location, (x, y), in the prior state s-1. For example, all the standard image tone 
and color change transformations such as contrast and brightness transformations, are 
stationary transformations. For all such stationary transformations, the generating region 
of any tile with address (x, y) is exactly the rectangular pixel region it occupies, and the 
set of supplier tiles for the tile is exactly the tile with the identical (x, y) address in the 
prior state. Hence a new stationary transformation can be added to the set of 
transformations with minimal changes to the FindlntersectingRegion procedure. 

In fact, it is advantageous to organize the flow of control so that in case a 
transformation is stationary the control logic in procedure BuildTilesFromPriorState 124 
substitutes a single call to a procedure GetSameXyTileFromPriorState for the pair of calls 
at 126 and 127 to FindGeneratingRegion 188 and GetTiles 101. If the flow of control is 
organized in this manner, it is only necessary to modify a single procedure, BuildTile, 
when adding a new stationary transformation. 



5. CONCLUSIONS 
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In conclusion, the preferred embodiments provide a system and method of 
editing digital images that are fast and flexible, permitting full undo and zoom and pan 
capabilities, on computer systems of limited memory, and which is suitable for operation 
5 on a networked computer system, in which the principal computational tasks for image- 
editing are performed by a server computer, and in which the user interface and image 
display are handled by a networked client computer. Systems embodying the invention 
are able to provide these features because of the inductive image generation technology, 
using cached state-specific image tiles. While the above is a complete description of the 

1 0 specific embodiments of the invention, various alternatives, modifications, and 

equivalents may be used. In particular, the word "tile" has been used in the discussion of 
specific embodiments to mean a horizontal rectangle of the image, but the invention may 
be embodied in systems in which the tiles are not horizontally oriented or are not 
rectangles. In addition, the word "tile" has been used in the discussion of specific 

1 5 embodiments to mean one of a pliu:ality of identically shaped image portions, but the 
invention may be embodied in systems in which the tiles are not identically shaped. In 
addition, the word "tile" has been used in the discussion of specific embodiments to mean 
one of a set of image portions covering an image exhaustively and also minimally, 
without overlap, but the invention may be embodied in systems in which some of the tiles 

20 do overlap, so that they do not cover the image minimally. Therefore, the above 

description should not be taken as limiting the scope of the invention which is defined by 
the appended claims, and in which the word "tile" is intended to mean "an image 
portion". 



